Comment on: "Solving the riddle of the bright mismatches: Labeling and effective 

binding in oligonucleotide arrays" 
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In a recent paper [Phys. Rev. E 68, 011906 (2003)], Naef and Magnasco suggested that the 
"bright" mismatches observed in Affymetrix microarray experiments are caused by the fluorescent 
molecules used to label RNA target sequences, which would impede target-probe hybridization. 
Their conclusion is based on the observation of "unexpected" asymmetries in the affinities obtained 
by fitting microarray data from publicly available experiments. We point out here that the observed 
asymmetry is due to the inequivalence of RNA and DNA, and that the reported affinities are 
consistent with stacking free energies obtained from melting experiments of unlabeled nucleic acids 
in solution. The conclusion of Naef and Magnasco is therefore based on an unjustified assumption. 

PACS numbers: 87.15.-v,82.39.Pj 



d 
•I— I 

I 



> 
m 
(N 
O 

o 
o 



X 



In a recent paper [H Naef and Magnasco investigated 
the problem of bright mismatches (MMs) in Affymetrix 
DNA microarrays. In these arrays a perfect matching 
(PM) 25 nucleotides probe is accompanied by a MM one 
in which a single nucleotide in the central position is mod- 
ified. The MM probes in Affymetrix chips are introduced 
with the purpose of estimating the contributions from 
non-specific hybridization. One of the problems with this 
approach is that of the so-called "bright" mismatches, for 
which the fluorescence intensity measured from a MM 
probe is higher than that from the corresponding PM 
probe. This is seemingly at odds with basic thermody- 
namics, as a perfectly matching duplex is more stable 
than one containing a mismatch. The analysis of exper- 
imental data shows that the occurrence of bright mis- 
matches is rather frequent. It was found to occur in 30% 
of the probes in the Affymetrix Human IIGU95a microar- 
ray i. 

The authors of Ref. 1] analyze a series of microarray 
experiments performed by Affymetrix. In this set of ex- 
periments the targets are single stranded RNA molecules 
where some of the pyrimidines (the U and C bases) 
carry a biotin, while the probes are single stranded DNA 
molecules. After the hybridization step is terminated the 
solution containing the non-hybridized target is washed 
off from the array. Fluorescent labels, attached to strep- 
tadivin molecules, are then added. The biotin is a strong 
binding site for the streptadivin during this staining step. 
Although Naef and Magnasco use "biotin" and "fluores- 
cent label" as synonyms in Ref. |lj, it is important to 
emphasize that there are no fluorescent labels during the 
hybridization step in Affymetrix experiments. 

In Ref. (ij the brightness of PM probes is fitted using 
affinities Ai^i in which I — A,C,T or G denotes the base 
type and « = 1, 2 ... 25 the position along the probe. The 
resulting fit-values are plotted in their Fig. 3. Notably, 
there are differences between affinities for A,T and C,G 
bases, which were interpreted in Ref. as as due to 



TABLE I: The stacking free energy parameters AG37 for 
RNA/DNA hybrids measured in solution at a salt concen- 
tration 1 M NaCl and T = 37° 3]. The upper strand is RNA 
(with orientation 5'-3') and lower strand DNA (orientation 
3'-5'). 



Sequence 


-AG37(kcal/mol) 


Sequence 


-AG37(kcal/mol) 


rAA 
dTT 


1.0 


rAG 
dTG 


2.1 


rAG 
dTC 


1.8 


rAU 
dTA 


0.9 


rCA 
dGT 


0.9 


rCC 
dGG 


2.1 


rCG 
dGC 


1.7 


rCU 
dGA 


0.9 


rGA 
dCT 


1.3 


rGC 
dCG 


2.7 


rGG 
dCC 


2.9 


rGU 
dCA 


1.1 


rUA 

dAT 


0.6 


rUC 
dAG 


1.5 


rUG 
dAC 


1.6 


rUU 
dAA 


0.2 



the presence of biotin in some bases of the RNA strand. 
We quote from Ref. : "... An unexpected aspect of 
the above fits is the asymmetry of A versus T (and G 
versus C) affinities, which goes against the zeroth order 
energetic consideration that A-T and T-A bonds (or G-C 
and C-G) would contribute equally to the binding . . . The 
obvious culprits are the fluorescent labels . . . " . 

We strongly disagree with this interpretation of the 
data. There is no "symmetry" between A-T and T-A, 
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TABLE II: Column 2: Single nucleotide free energies ob- 
tained from Eq. Q expressed in kcal/mole. Column 3: The 
same free energies to which the average value is subtracted. 
Column 4: The log 10 affinities derived from the data of col- 
umn Shy Aj = {{AG) - AGj)/{RTlnW), with RT = 0.63 
kcal/mole. Column 5: Effective affinities for the middle bases 
as given in the Fig. 3 of Ref. 
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-AG., 


(AG) - AG., 




Ref. fl] 


C 


4.00 


1.09 


0.75 


0.20 


G 


3.50 


0.59 


0.40 


0.02 


T 


2.40 


-0.51 


-0.35 


-0.01 


A 


1.75 


-1.16 


-0.80 


-0.20 



since one nucleotide is part of an RNA strand and the 
other of DNA. To start with, RNA does not have a T 
(thymine) but a U (uracil), and the authors compare an 
A-T binding with a U-A binding; and although the usual 
naming of a C-G and a G-C binding suggests symmetry, 
also this symmetry is broken by the different backbones 
of RNA and DNA. 

The thermodynamics of RNA/DNA duplexes in so- 
lution has been investigated in a series of experiments 
in which the melting temperatures of short duplexes, of 
about 20 nucleotides, are measured (see e.g. Q). These 
measurements provide estimates of the differences in en- 
thalpy AH and entropy AS" between a duplex and the 
two separate strands. As for other types of duplexes, 
e.g. DNA/DNA and RNA/RNA, it turns out that AH 
and AS" can be well approximated by sums of sequence 
dependent local terms taking into account the contribu- 
tion of hydrogen bonds and stacking interactions between 
neighboring bases. As a consequence the free energy dif- 
ference, or hybridization free energy, AG ~ AH — TAS 
is also well approximated by a sum of local terms. The 
latter are given in Table H . Note the asymmetries 
between the free energy parameters when interchanging 
nucleotides between DNA and RNA strands. 

In order to compare the hybridization free energies in 
solution with the affinities reported in Ref. [H , which are 
single nucleotide dependent, we fix a nucleotide on the 
probe strand and average the values in Table over all 



possible neighbors in the 3' and 5' direction. For instance 
for a T in the DNA strand we define: 

're{A,T,G,c} '- ^ ' ^ \ ' / J 

where 7' is the nucleotide in the RNA strand comple- 
mentary to 7. These free energies and the corresponding 
binding affinities are given in Table ITTI 

Note that Eq. gives lower affinities for A compared 
to T and for G compared to C, in qualitative agreement 
with the data of Ref. Q in Table ITIl While in Ref. Q 
these differences were argued to prove that biotin affects 
the binding, our analysis clearly shows that these differ- 
ences (or asymmetries as referred to in PJ) are intrinsic 
properties of unbiotinylated RNA/DNA duplexes in so- 
lution. 

It is not surprising that the effective affinities mea- 
sured in Ref. Q are smaller than the binding free ener- 
gies obtained from Eq. 111). The affinities of Ref. [l| 
are obtained by fitting the measured ffuorescent signals 
of the microarray to a Langmuir model (Eq. (1) in 11). 
The fluorescence measured in the microarray experiment 
is not solely determined by the binding free energy be- 
tween an isolated probe and a specific target, but it is also 
influenced by many other effects like polydispersity in 
probe and target lengths, secondary structure formation 
in probes and targets, and hybridization between targets 
in solution. We thus do not expect that the affinities of 
Ref. should agree quantitatively with the binding free 
energies in solution. As the neglected processes compete 
with the hybridization of a probe with a complementary 
target, it is to be expected that the difference in effec- 
tive affinities of Ref. are lower than their solution 
counterparts. 

Certainly, Ref. Q does not show that fluorescent la- 
bels (or, to be precise, the biotin linker) interfere with 
binding, or are the cause of bright mismatches. 
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